Spelling Correction as a Foreign Language
نویسندگان
چکیده
In this paper, we reformulated the spell correction problem as a machine translation task under the encoder-decoder framework. This reformulation enabled us to use a single model for solving the problem that is traditionally formulated as learning a language model and an error model. This model employs multi-layer recurrent neural networks as an encoder and a decoder. We demonstrate the effectiveness of this model using an internal dataset, where the training data is automatically obtained from user logs. The model offers competitive performance as compared to the state of the art methods but does not require any feature engineering nor hand tuning between models.
منابع مشابه
Design and implementation of Persian spelling detection and correction system based on Semantic
Persian Language has a special feature (grapheme, homophone, and multi-shape clinging characters) in electronic devices. Furthermore, design and implementation of NLP tools for Persian are more challenging than other languages (e.g. English or German). Spelling tools are used widely for editing user texts like emails and text in editors. Also developing Persian tools will provide Persian progr...
متن کاملHANSpeller: A Unified Framework for Chinese Spelling Correction
Increased interest in China from foreigners has led to a corresponding interest in the study of Chinese. However, the learning of Chinese by non-native speakers will encounter many difficulties, Chinese spelling check techniques for Chinese as a Foreign Language(CFL) learners is highly desirable. This paper presents our work on the SIGHAN-2015 Chinese Spelling Check task. The task focuses on sp...
متن کاملNaturalization in Translation:A Case Study on the Translation of English-Indonesian Medical Terms
Naturalization is a translation procedure that is predominantly utilized in the translation of English medical terms into Indonesian. This study focuses on identifying types of naturalization involving the adjustment of spelling and pronunciation and investigating whether naturalization has been appropriately applied based on the rules in the Indonesian general guidance of term formation. The d...
متن کاملNaturalization in Translation:A Case Study on the Translation of English-Indonesian Medical Terms
Naturalization is a translation procedure that is predominantly utilized in the translation of English medical terms into Indonesian. This study focuses on identifying types of naturalization involving the adjustment of spelling and pronunciation and investigating whether naturalization has been appropriately applied based on the rules in the Indonesian general guidance of term formation. The d...
متن کاملSpelling-based Phonics Instruction: It’s Effect on English Reading and Spelling in an EFL Context
Systematic phonics instruction in first language education has recently received considerable research attention due to its critical role in facilitating phonological awareness and processing skills. However, little is known about the effects of systematic phonics instruction on foreign language reading and spelling in an EFL context. This study examined the effects of spelling-based phonics in...
متن کاملPronunciation Modeling in Spelling Correction for Writers of English as a Foreign Language
We propose a method for modeling pronunciation variation in the context of spell checking for non-native writers of English. Spell checkers, typically developed for native speakers, fail to address many of the types of spelling errors peculiar to non-native speakers, especially those errors influenced by differences in phonology. Our model of pronunciation variation is used to extend a pronounc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1705.07371 شماره
صفحات -
تاریخ انتشار 2017